Epoch Extraction of Voiced Speech
نویسنده
چکیده
A general theory of epoch extraction of overlapping nonidentical waveforms is presented. The theory is applied to outputs of models of voiced speech production mechanism and to actual speech data. Some typical glottal waveshapes are considered to explain their effect on the speech output. It is shown that the points of excitation of the vocal tract can be precisely identified for continuous speech. It is possible to obtain accurate pitch information by this method even for high-pitched sounds. The epoch extraction has wide applications in speech analysis, speaker verification, speech synthesis, and pitch perception studies.
منابع مشابه
Time-Order Representation Based Method for Epoch Detection from Speech Signals
Epochs present in the voiced speech are defined as time instants of significant excitation of the vocal tract system during the production of speech. Nonstationary nature of excitation source and vocal tract system makes accurate identification of epochs a difficult task. Most of the existing methods for epoch detection require prior knowledge of voiced regions and a rough estimation of pitch f...
متن کاملParameterizing Speech Phonemes by Exponential Sinusoidal Model
The exponential sinusoidal model (ESM) for parameterizing speech signals is proposed in this paper. The main feature of the ESM is that the amplitude of each sinusoidal component is allowed to vary exponentially with time. A novel variable segmentation strategy is applied first to separate individual transients of a voiced phoneme, which can then be fitted with the ESM. The epoch of a transient...
متن کاملAnalysis of instantaneous F0 contours from two speakers mixed signal using zero frequency filtering
Instantaneous fundamental frequency (F0) in voiced speech can be obtained from the sequence of epochs corresponding to the instants of significant excitation. The epoch sequence can be derived using the recently proposed epoch extraction method based on zero frequency filtering. The epoch extraction method is robust against additive noise degradation. But in a multispeaker mixed signal, the deg...
متن کاملUniform concatenative excitation model for synthesising speech without voiced/unvoiced classification
In general, speech synthesis using the source-filter model of speech production requires the classification of speech into two classes (voiced and unvoiced) which is prone to errors. For voiced speech, the input of the synthesis filter is an approximately periodic excitation, whereas it is a noise signal for unvoiced. This paper proposes an excitation model which can be used to synthesise both ...
متن کاملAutomatic detection of creaky voice using epoch parameters
This paper proposes a method based on epoch parameters for detection of creaky voice in speech signal. The epoch parameters characterizing the source of excitation considered in this work are number of epochs in a frame, strength of excitation of epochs and epoch intervals. Analysis of epoch parameters estimated from zero-frequency filtering method with different window sizes is carried out. Di...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009